Curating the CIA World Factbook

نویسندگان

  • Peter Buneman
  • Heiko Müller
  • Chris Rusbridge
چکیده

The CIA World Factbook is a prime example of a curated database – a database that is constructed and maintained with a great deal of human effort in collecting, verifying, and annotating data. Preservation of old versions of the Factbook is important for verification of citations; it is also essential for anyone interested in the history of the data such as demographic change. Although the Factbook has been published, both physically and electronically, only for the past 30 years, we appear in danger of losing this history. This paper investigates the issues involved in capturing the history of an evolving database and its application to the CIA World Factbook. In particular it shows that there is substantial added value to be gained by preserving databases in such a way that questions about the change in data, (longitudinal queries) can be readily answered. Within this paper, we describe techniques for recording change in a curated database and we describe novel techniques for querying the change. Using the example of this archived curated database, we discuss the extent to which the accepted practices and terminology of archiving, curation and digital preservation apply to this important class of digital artefacts. 1 This paper is based on the paper given by the authors at the 5th International Digital Curation Conference, December 2009; received November 2009, published December 2009. The International Journal of Digital Curation is an international journal committed to scholarly excellence and dedicated to the advancement of digital curation across a wide range of sectors. ISSN: 1746-8256 The IJDC is published by UKOLN at the University of Bath and is a publication of the Digital Curation Centre. 30 Curating the CIA World Factbook

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Exploratory Analysis of CIA Factbook Data Using Kohonen Self-Organizing Maps

A visual country comparison is of great importance to research and practice. The method of Kohonen Self-Organizing Maps (SOM) is able to present the data in a visual map and at the same time tries to maintain the topological features of the data. We employ SOM and use data from the 2007 Central Intelligence Agency (CIA) World Factbook to identify what patterns exist between selected countries a...

متن کامل

Concepts, Technology, and Applications in E-mentoring

The so-called “Internet revolution” has dramatically changed the way people communicate and work nowadays. Attending to The Word Factbook developed by the U.S. Central Intelligence Agency (CIA), there are 1,018,057,389 Internet users in the world by 2005 (CIA, 2006). Fostering of the Internet revolution from a business perspective is out of question and the ever-growing number of Web functional...

متن کامل

IR and AI: Using co-occurrence Theory to Generate Lightweight Ontologies

This paper illustrated the application of cooccurrence theory to generate lightweight ontologies semi-automatically. First, the relationship of Information Retrieval (IR) and Artificial Intelligence (AI) is discussed in a general way. Then two case studies have been conducted to generate lightweight ontologies in specific domains (Information Retrieval domain and European part of CIA FactBook)....

متن کامل

Genetics and Genomic Medicine in Colombia

Colombia is a country located in the northwest corner of South America (Fig. 1). Initially founded in 1717 as the Viceroyalty of New Grenada, it underwent many transitions in its government and territory after winning its independence from Spain in 1819, finally becoming the Republic of Colombia in 1886 (CIA, World Fact Book: https://www.cia.gov/library/publications/the-world-factbook/geos/co.h...

متن کامل

Estimating the Religious Composition of All Nations: An Empirical Assessment of the World Christian Database

The international religious data in the World Christian Database (WCD), and its print predecessor, the World Christian Encyclopedia (WCE) have been used frequently in academic studies and the popular press. Scholars have raised questions about the WCD’s estimates categories, and potential bias, but the data have not yet been systematically assessed. We test the reliability of the WCD by compari...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • IJDC

دوره 4  شماره 

صفحات  -

تاریخ انتشار 2009